Using default inheritance to describe LTAG

نویسندگان

  • Roger Evans
  • Gerald Gazdar
  • David J. Weir
چکیده

We present the results of an investigation into how the set of elementary trees of a Lexicalized Tree Adjoining Grammar can be represented in the lexical knowledge representation language DATR (Evans & Gazdar 1989a,b). The LTAG under consideration is based on the one described in Abeille et al. (1990). Our approach is similar to that of Vijay-Shanker & Schabes (1992) in that we formulate an inheritance hierarchy that efficiently encodes the elementary trees. However, rather than creating a new representation formalism for this task, we employ techniques of established utility in other lexically-oriented frameworks. In particular, we show how DATR's default mechanism can be used to eliminate the need for a non-immediate dominance relation in the descriptions of the surface LTAG entries. This allows us to embed the tree structures in the feature theory in a manner reminiscent of HPSG subcategorisation frames, and hence express lexical rules as relations over feature structures. Vijay-Shanker & Schabes (1992) have drawn attention to the considerable redundancy inherent in LTAG lexicons that are expressed in a flat manner with no sharing of structure or properties across the elementary trees 1. In addition to theoretical considerations, one practical consequence of such redundancy is that maintenance of the lexicon becomes very difficult. One way to minimise redundancy is to adopt a hierarchical lexicon structure with inheritance and lexical rules. Vijay-Shanker & Schabes outline such a view of an LTAG lexicon which is loosely based on that of Flickinger (1987) but tailored for LTAG trees rather than HPSG subcategorization lists. We share their perception of the problem and agree that adopting a hierarchical approach provides the best available solution to it. However, we see no need for the creation of a hierarchical lexical formalism that is specific to the LTAG problem. The use of hierarchical lexicons to reduce or eliminate lexical redundancy is now a fairly well researched area of NLP (Daelemans & Gazdar 1992; Briscoe et al. 1993) and a variety of formal languages for defining such lexicons already exist. One of the more widely known and used of these languages is DATR (Evans & Gazdar 1989a,b); in this paper we will show how DATR can be used to formulate a compact, hierarchical encoding of an LTAG lexicon. There are three major advantages to using an " off the shelf " lexical knowledge representation language (LKRL) like DATR. The first is that it makes it easier …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Encoding Lexicalized Tree Adjoining Grammars with a Nonmonotonic Inheritance Hierarchy

This paper shows how DATR, a widely used formal language for lexical knowledge representation, can be used to define an LTAG lexicon as an inheritance hierarchy with internal lexical rules. A bottom-up featural encoding is used for LTAG trees and this allows lexical rules to be implemented as covariation constraints within feature structures. Such an approach eliminates the considerable redunda...

متن کامل

In Proceedings ACL - 95 1 Encoding Lexicalized Tree Adjoining

This paper shows how DATR, a widely used formal language for lexical knowledge representation , can be used to deene an LTAG lexicon as an inheritance hierarchy with internal lexical rules. A bottom-up featural encoding is used for LTAG trees and this allows lexical rules to be implemented as co-variation constraints within feature structures. Such an approach eliminates the considerable redund...

متن کامل

The Use of Default Unification in a System of Lexical Types

In this paper we describe the encoding of a Unification-Based Generalised Categorial Grammar for English, in terms of a default inheritance network of types, implemented with YADU, which is an order independent default unification operation on typed feature structures. We then propose to use this framework to encode a Universal Grammar (UG) and associated parameters, following the Principles an...

متن کامل

Structure Sharing in Lexicalized Tree-Adjoining Grammars

We present a scheme for efficiently representing a lexicaiized tree-adjoining grammar (LTAG). The propcoed representational scheme allows for structure-sharing between lexical entries and the trees associated with the lexical items. A compact organization is achieved by organizing the lexicon in a hierarchical fashion and using inheritance as well as by using lexical and syntactic rules. While ...

متن کامل

A Lexicalized Tree Ad- Joining Grammar for English. a Lexicalized Tree Adjoining Grammar for English. Automatic Acquisition of Datr Theories from Observations. Theories Des Lexicons: 6 Comparison with Related Work 5 Applying Lexical Rules

This paper shows how DATR, a widely used formal language for lexical knowledge representation, can be used to de ne an LTAG lexicon as an inheritance hierarchy with internal lexical rules. A bottom-up featural encoding is used for LTAG trees and this allows lexical rules to be implemented as covariation constraints within feature structures. Such an approach eliminates the considerable redundan...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره cmp-lg/9501001  شماره 

صفحات  -

تاریخ انتشار 1994